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ABSTRACT 


In this thesis we describe a new approach to characterize texture images at multiple scales 
using angular wavelet frames (AWF). We have constructed 2D AWF by applying frequency 
transformations to the prototype filters of a ID perfect reconstruction filter bank (PRFB) to 
obtain 2D filters. The objective of constructing AWF is to improve the performance of existing 
wavelet based methods for classification and segmentation of textures having dominant angular 
components. A texture is characterized by a set of channel variances estimated at the output of 
the corresponding 2D filter bank. Classification results for 26 Brodatz textures indicate that 
performance of the proposed AWF method is comparable to the existing wavelet based 
algorithms. The constructed AWF is also applied to perform supervised texture segmentation. 
We considered several images with different number of texture regions, and used the standard k 
- means clustering algorithm to integrate the feature images and produce segmentation of the 
original texture images. The performance of the supervised texture segmentation algorithm 
using AWF is found satisfactory. 
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Chapter 1 


INTRODUCTION 

1.1. An Overview 

Texture can be used in the analysis of images in several ways: in the segmentation of 
scenes into distinct objects and regions, in the classification or recognition of surface 
materials, and in the computation of surface shape. A perceptual surface texture may be 
informally defined to be a spatial pattern of local surface radiances giving rise to a perception 
of surface homogeneity. Within this context, an image texture may be defined as a local 
arrangement of image irradiances projected from a surface patch of perceptually 
homogeneous radiances. 

Textures provide important characteristics for surface and object identification from aerial 
or satellite photographs, biomedical images, and many other types of images. Their analysis 
is fundamental to many applications such as industrial monitoring of product quality, remote 
sensing of earth resources, and medical diagnosis. The analysis of texture requires the 
identification of those texture attributes, which can be used for segmentation, discrimination, 
recognition, or shape computation and the development of computational approaches for 
accomplishing these tasks. Texture segmentation involves partitioning a digital image into a 



2 


finite set of regions with uniform textures. This problem plays an important role in the field 
of image processing and computer vision. To design an effective algorithm for texture 
segmentation, it is essential to find a set of texture features with good discriminating power. 

1.2. Existing Methods 

There exists a variety of feature extraction methods in texture analysis such as gray level 
co-occurrence matrix (GLCM) [1], Laws’ texture energy (LAWS), second - order statistics. 
Gauss - Markov random field models, local linear transforms etc. They are restricted to the 
analysis of relatively small neighborhoods. As a consequence, their performance is best for 
the analysis and segmentation of the class of so-called micro textures. Recent models of 
human and mammalian vision suggest the existence of an internal spatial/frequency 
representation that is capable of preserving both local and global information. These findings 
have been the basis for several approaches to texture using bank of Gabor filters with 
different scale and orientation tuning [5], [6], [8], [9], [12]. 

The Gabor channel filters are complex sinusoidal (carrier) gratings modulated by 2D 
Gaussian functions in the space domain, and shifted Gaussians in the spatial frequency 
domain. The Gabor functions are complex valued functions on R , except when the carrier 
sinusoid has zero center frequency. In this case, the Gabor functions reduce to the 2D 
Gaussian functions. Gabor channel filters are well localized in space and frequency. This 
technique extracts features by filtering the texture image with a selected subset of a fixed 
Gabor filter bank. Several different criterions have been developed for the selection of the 
“right” subset of the filter bank, such as the filtered image energy [5] or the local maxima in 
the Fourier transform of the texture image [6]. The potential disadvantage of decompositions 
using the Gabor filter bank is that they are computationally quite intensive, especially for the 
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evaluation of low-frequency components. In addition, the outputs of Gabor filter banks are 
not mutually orthogonal, which may result in a significant correlation between texture 
features. Finally, if the center frequency of a selected Gabor filter does not match any of the 
important harmonics in the texture image it produces only “noisy” information and therefore, 
it degrades the segmentation performance. Most of these problems can be avoided by using 
wavelet transform. 

The wavelet transform methods provide a precise and unifying framework for the analysis 
and characterization of a signal at different scales [2]-[4], The use of a pyramid-structured 
wavelet transform for texture analysis was first suggested by Mallat [3]. This initial proposal 
has been followed by several studies on texture classification with a particular attention to the 
use of wavelet packets [7]. The algorithm developed by Chang and Kuo yields a texture 
specific wavelet tree stricture, which is used as a feature for classification. Unfortunately, 
this algorithm is not suitable for texture image segmentation because the extracted features 
are based on the wavelet tree structure of the entire image. However, each pixel of the 
images has to be classified and thus a feature vector for each pixel is needed. 

Unser [10] proposed an overcomplete wavelet decomposition, which he called Discrete 
Wavelet Frame (DWF), by incorporating redundant information to yield a translation 
invariant description of textures. The DWF feature extraction technique was used for texture 
discrimination and can also be incorporated into a multidimensional texture segmentation 
algorithm. These wavelet-based methods decompose the frequency plane into four 
rectangular segments. As a result, their analysis of textures with dominant angular 
component is rather poor. 
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His-chin Hsin [13] proposed modulated wavelet transform with a modulating frequency 
pair, which decomposes the image in the frequency region centered at the modulating 
frequency pair. The modulated wavelet transform zooms in the frequency region centered at 
the modulating frequencies for further decompositions. However, the selection of suitable 
modulating frequencies is a difficult task. 

1.3. Objective of Thesis 

Gabor channel filters are well localized in both space and frequency but they are 
computationally quite intensive. Also the outputs of the Gabor channels are not mutually 
orthogonal, which result in correlation between texture features. The wavelet-based methods 
cannot capture the dominant angular frequency information. To over come these problems, 
we propose the construction of angular wavelet frames (AWF). To capture dominant angular 
frequency information, we construct 2D angular wavelet frames using circular and angular 
transformations on ID prototype filter. The circular transformation on ID low pass filter 
gives a circularly symmetric low pass filter which has non zero value for a>i 2 +a> 2 2 < tt/2 and 
is zero for coi 2 +co 2 2 > tc/ 2. The angular transformation on ID low pass filter gives a 2D filter, 
which selects an angular region between 0 and n/2 in 2D positive frequency plane. We 
construct the 2D angular wavelet frames by using the combination of 2D circular and angular 
transformed filters. We also examine its performance for texture classification and 
segmentation. 

1.4. Organization of Thesis 

In chapter 1, a brief review of the existing methods used for the classification and 
segmentation of texture images is presented. 
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In chapter 2, we give the definition of the filter banks and discuss the main properties of 
the orthogonal wavelet transform in h (the space of square summable sequences). This 
approach is further extended to the case of an over complete representation, and the concept 
of an orthogonal (or tight) wavelet frame is introduced. Finally we explain the method of 
constructing 2D angular wavelet frames using frequency transformations. 

In chapter 3, texture classification and segmentation using 2D Angular Wavelet Frames is 
presented. Chapter 4 gives results obtained for texture classification and segmentation using 
Angular Wavelet Frames. 
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Chapter 2 

WAVELET TRANSFORMS AND FRAMES 

The wavelet transform is usually described as multiresolution decomposition for finite 
energy functions / of the continuous variable x (i .e.,/ (x) e L 2 ) [3]. In the present context, 
it is more appropriate to consider wavelet representations for discrete signals in I 2 (the 
space of square summable sequences), which was initially proposed by Rioul [14]. 
Section 2.1 gives the definition of the underlying filter banks and 2.2 provides a brief 
review of the main properties of the orthogonal wavelet transform in l 2 . In Section 2.3, 
this approach is extended to the case of an over complete representation, and the concept 
of an orthogonal (or tight) wavelet frame is introduced. In Section 2.4, 2D Angular 
Wavelet Frame design using frequency transformation is presented. 

2.1. Filter Bank Specification and Properties 

In this section, we characterize the perfect reconstruction filterbanks (PRFB) 
underlying the definition of the discrete wavelet transform [14], that we have used for our 
work. Wc start with the prototype filter li which satisfy the condition 
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H(z)H(z) + G(z) G(z ) = 1 (2.1) 

where H(z) is the z - transform of the low-pass filter h and g is its complementary high 
pass filter and is given by 

G{z) — zH(-z~ x ) (2.2) 

H(z) and G(z) are orthogonal or biorthogonal complements of filters H(z) and G(z) 
respectively. These prototype filters are then used to generate, in an iterative fashion, a 
sequence of filters of increasing width (indexed by i ) 

H M (z) = H(z 2 ')H i (z) 

G m (z) = G(z 2 ')//,.(z), (z = 0,l,...,/-l) 

with the initial condition Ho (z) = 1 . 

Equivalently, in the signal domain, we have the two - scale relation 

h l+l (k) = [h] t2l *h'(k) 

(2.4) 

g M (k) = [g] n ,*h i (k) 

where the notation denotes the upsampling by a factor of tn. The effect of one 

iteration is more or less to dilate the filters h i and g ( by a factor of two. Such a sequence 

of filters can be used to decompose a signal in subbands of approximately one octave 
each. The properties of such decomposition are best understood by analyzing the multi- 
band characteristics of the underlying filter bank. Some examples are shown in Fig. 1 . It 
can easily be shown that such sequences of filters also satisfy the identity 

H^H^ + YP^G^-) = 1 (2.5) 

k= 1 

and therefore, provide a full coverage of the frequency domain. 
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2.2. Orthogonal Wavelet Bases of l 2 

The sequence of discrete filters defined in the previous section 2.1 can be used to 
construct an orthogonal wavelet decomposition of h (Rioul). The discrete normalized 
basis functions can be defined as 

M*) = 2' 72 h t (k-2‘ l) (2.6) 

<M*> = T' 2 gi (k-Vl ) (2.7) 

Where i and / are the scale and translation indices, respectively; the factor 2' /2 is an 
inner product normalization. Now, we consider the sequence of nested subspaces 
l 2 =V 0 Z)V I zd...zdV / where V i =span {(p i t } leZ is the approximation space at resolution z . 

The embedding of these subspaces comes as a direct consequence of the two scale 

relations (2.4). We also introduce the subspaces W-, (z = 1, 2, . . . , I), where W t is the 
residue (or detail) space at resolution i and is defined as the orthogonal complement of 
V. with respect to V i _ x (i.e., Fj_, =V j + W i and V t It can be shown by induction 

that the families of sequences {<p,,/} /€Z and {i//, / } /eZ provide orthonormal bases of V { and 
W;, respectively. We can therefore evaluate the coordinates of the orthogonal projection 
of a discrete signal xe/ 2 in the spaces V i and W { by forming the inner product with 
corresponding basis function. The minimum h - norm approximation of x at scale i 
(orthogonal projection into V t ) is given by 

*«(*) = £*(l)(0<Pi,t ( 2 - 8 ) 

lez 


s (i) ( 0 = < x (k), V u (.k) >, 2 


(2.9) 
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where < •, • > denotes the standard h inner product and where <p ( . 0 (k) = 2' /2 h i (k) is the 

discrete scaling function at resolution /. The corresponding residue, which is the 
projection of x into W { , is given by the complementary wavelet expansion 

*(,•-» (*)-*(,#) = IX/OVO/ (2.10) 

lez 

d d) 0 ) = < x(k), y/. ; (A:) > ; _ 7 (2.11) 

where y /,• 0 (&) = 2 ,n g t (k) is the discrete wavelet at scale i. Finally by combining the 

residues over all scales down to a given depth I, we obtain the full discrete wavelet 
expansion of a signal xel 2 

x ( k ) = 2></> + EZ d o> (OV'o (2- 12) 

UZ 1=1 uz 

where d (i) 's are the wavelet coefficients and s (/) "s are the expansion coefficients of the 
coarser signal approximation x (/) . It also follows from this construction that the family of 
sequences {(p iJy V / u>V / 2 ,/>—>V / /,/}/ez constitutes an orthonormal basis of h. 

The analysis formulas 2.9 and 2.1 1 can be interpreted in terms of simple filtering and 
down-sampling operations. Specifically, we can write that 


s (I) (l) = 2 in [h] *x] i2 , (/) 

(2-13) 

d { 0 (0 = 2 in [gJ*x] l2l (l), 0 = 1 , 2 ,...,/) 

(2.14) 


where symbol T denoted the reversal operation (i.e., h T (k) = h (-k) and where [-] u is the down- 


sampling by a factor of m. 
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2.3. Discrete Wavelet Frames (DWF) 

A simple integer shift of the input signal will usually result in a non-trivial modification of 
the discrete wavelet transform. As far as feature extraction is concerned, this behavior is 
inadequate, for one usually thinks of “texture” as a translation-invariant (or stationary) property. 
A natural way to over come this limitation is to perform an analysis of the input signal in terms 
of the overcomplete family of templates 

' 5 = (2.15) 

which is similar to computing the discrete wavelet transform for all possible integer shifts of the 
input signal. This approach leads to the simple decomposition algorithm 

dr (k) = < gl (k-l),x(k) > h = gf *x(k) 

S? WF (k) = < h, (k-l),x(k) >, 2 = h] *x(k) (2.16) 

O' = 1,2,...,/) 

which is non-sampled version of Equation(2.13) and Equation(2.14). Unlike other wavelet-based 
approaches, this overcomplete wavelet decomposition (the discrete wavelet frame (DWF)), in 
which the output of the filter bank is not sub sampled, result in a texture description that is 
invariant with respect to translations of the input signal. This property yields a better estimation 
of texture statistics and a more detailed texture characterization at region boundaries. Because of 
the special structure of the analysis filter bank, this decomposition has a number of remarkable 
properties that are associated with the mathematical concept of a frame [2]. These include energy 
conservation and a particularly simple reconstruction algorithm. 
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The reconstruction formula 

x(k) = ]►>,(/)*,(*-/) + Y l '£d,V)g l (k-l) (2-17) 

/eZ » = 1 leZ 

which can also be interpreted in terms of a reconstruction filter bank. 

x(k) = h f *s I (k) + '£ gi * 4 (A) (2-18) 

1 = 1 

2.4. 2D - Angular Wavelet Frames (AWF) 

2.4.1. Extension of ID approach to 2D 

To extend the discrete wavelet frame (DWF) method to 2D, commonly used way is to use the 
standard tensor product formulation. This gives the decomposition, which is rectangular in nature 
in frequency domain as shown in Fig. 2. But our aim is to select dominant angular regions in the 
frequency plane. 

0)2 

COl 

Fig. 2. Frequency plane segmentation in 2D separable wavelet frame 
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In order to overcome this problem we suggest frequency transformation method for the 
purpose of extension to 2D. In conventional 2D filter design techniques, frequency 
transformation methods are used to construct a 2D filter from ID filter. We constructed 2D 
filters using two transformations - circular transformation and angular transformation on ID 
filters. Different ID prototype filters used in our work are c db2’, ‘db4’ and ‘dblO’. Now we 
describe the two transformation techniques here. Consider the transformation 

tf(©„© 2 ) = //(©) l =TM) (2.19) 


where r(©,,© 2 ) is the transformation function. The frequency response of the ID prototype 
filter H((o) and the transformation function are jointly chosen to design 2D filter H(o) i ,a> 2 ) 
with desired properties. 


The two transformations that we have used are: 

r(tt>, ,© 2 ) = -Jcof +(l >2 — circular transformation 

= 2 *atan(© 2 /©,) - angular transformation 


( 2 . 20 ) 


The circular transformation on ID low pass filter results in 2D circularly symmetric low pass 
filter. The low pass filters have significant response between -u/2 and tt/2. On applying the 2D 
circular transformation on the low pass filter, a value of the low pass filter at a given © is 
transferred onto a circle ©i 2 + © 2 2 = to 2 in the (®i, © 2 ) plane. So the resulting 2D filter has 
significant response for the region ©j 2 + co 2 2 < tu/2 and its magnitude beyond this region is very 
small. The frequency support of this 2D low pass filter is shown in Fig. 3. The region of support 
of this 2D filter is the shaded region in Fig. 3. The circular transformation on the corresponding 
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high pass filter results a 2D filter that has significant response for the region coj 2 + co 2 2 > nil and 
its magnitude in the region coj 2 + co 2 2 < nil is very small. The region of support of this 2D filter 
is the unshaded region in Fig. 3. 

The angular transformation on ID low pass filter gives a 2D angular filter. On applying the 
2D angular transfonnation on the low pass filter, a value of the low pass filter at a given co is 
transferred onto a straight line co 2 = coitan (co/2) in (co 1? co 2 ) plane. For 0 < co < nil, and for 
positive values of a>i, H (co) values are transferred to the angular region 0< 0 < n/4 and for 
negative values of coi they are transferred to the region n < 9 < 5n/4 of the frequency plane. For 
-7t/2 < co < 0, and for positive values of coi, H (co) values are transferred to the angular region 
-rc/4 < 0 < 0 and for negative values of ©i they are transferred to the region 3n/4 < 0 < n of the 
frequency plane. This 2D filter is hermitian symmetric. The frequency support of angularly 
transformed low pass filter is the shaded region in Fig. 4. The angular transformation of the 
corresponding high pass filter results in a 2D filter, which has frequency support in the unshaded 
region in Fig. 4. 



Fig. 3. Circular Transformation Fig. 4. Angular Transformation 
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2.4.2. Construction of 2D Angular Wavelet Frames 
2.4.2.I. Decomposition at Level 1 

Consider ID PRFB filters H((o ) and G(co) to construct a 2D angular wavelet frame based on 
ID wavelet frame. On applying the circular and angular transformations described in Equation 
(2.20) we obtain four filters H c , G c , H„ and G a . 


H e (co l ,co 2 ) = H(co)l_^ ! 
G>„© 2 ) = G(a>) 

H a (co i ,co 2 ) = H(a>) \ um3 *.^ /Ul) 

^ a l>® 2 ) ^(®) l£i) = 2*atan((o 2 /tt) l ) 


( 2 . 21 ) 


where the subscript V and ‘a’ denotes circular and angular transformations respectively. A 
useful wavelet frame can be constructed with the analysis filters 

F hh = H c H a ; Fh g = H c G a ; F gh = G c H a ; F gg = G c G a ; (2.22) 

The responses of these four filters using ID prototype filter ‘dblO’ are shown in Fig. 5. The 
Region of Supports (ROS) of these filters are shown in the Fig. 6. Fig. 6 (a) shows the ROS of 
HcHa, Fig. 6 (b) shows the ROS of HcGa, Fig. 6 (c) shows the ROS of GcHa, Fig. 6 (d) shows 
the ROS of GcGa. It is evident that the constructed 2D angular wavelet frames can capture the 
information in the regions as shown in the Fig. 5 and Fig. 6. At the synthesis end, the conjugate 
filters are constructed using conjugates of the ID filters in similar steps. The principle of this 
decomposition is illustrated in Fig. 7. This whole system of filters acts as the identity operator, 
i.e., the analysis and synthesis filters constitute a perfect reconstruction filter bank 
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GcHa 



“0 5 - 0.5 



(d) 

Fig. 5. Filter responses for level 1 using ‘dblO’ (c) GcHa (d) GcGa 
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Fig. 7. Single level Signal analysis and synthesis using the Angular Wavelet Frame decomposition 


Property: The 2D analysis filters constructed by circular and angular transformations of ID 
PRFB along with their conjugates at the synthesis end constitute a perfect reconstruction filter 
bank. 


Proof: From the Fig. 7, the output signal in terms of input can be written as 

S' = |>„ ?„ + F tg F ts + F„ F tt *F n F u ]s 
using eqn.(2.22) 

S' = [h, +H, +G, +G, G.Gfi u ]s 

= H,H t +G c G,\H.H,+G.G.]s 
= S (using PRFB condition eqn.(2. 1) 


Hence, the condition of perfect reconstruction for the designed filter bank is satisfied. 
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2.4.2.2. Decomposition at Higher Level 

Now we describe the method to decompose the signal at higher level. First consider the 
pyramidal decomposition where the low-low region is again decomposed into four regions. The 
set of ID prototype filters for the higher level of decomposition are obtained using the Equation 
(2.3). For level 2, using H 2 and G 2 (i=l) 2D circular and angular transformed filters H 2 C , G 2 C , 
H 2 a and G 2 a are obtained. 

Consider the following combination of analysis filters: 


F2hh - H2 c H 2a ; F 2hg - H 2c G 2a ; F 2g h = G2 C H2 a ; F2 gg = G2 C G2 a ; (2.23) 

These four filters decompose the region supported by Fhh ( HcHa ) into four regions. The 
frequency responses of these four filters are shown in Fig. 8. The Region of Support of these 
filters is shown in Fig. 12. These four filters along with H c G a , G c H a and G c G a are used for the 
second level of decomposition. At the synthesis end the conjugate filters are constructed using 
the conjugates of ID filters in similar steps. These seven filters along with their conjugates at the 
synthesis end constitute a perfect reconstruction filter bank. 

The output in terms of input can be written as: 

S' = [ F 2hh F Jhh +F 2 hg F 2hg + F 2gh F 2gh + F 2gg F 2gg + F hg F hg +F gh F gh + F gg F gg 


Using Equations (2.23), (2.3) and (2.1) we get 

S' = +F* +F lt F, +F a F„]s 


Using Equations (2.22) and (2.23) we get 
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S " = S 

Hence the condition of perfect reconstruction for the designed filter bank is satisfied. 

Using the pyramidal decomposition we obtain N = 1 + 31 (where I is the level of 
decomposition.) number of filters. 

We can as well decompose the other regions of the frequency plane each into four regions, 
which is called tree structured decomposition method. Using Equation (2.3) we can get 
decomposition in the low-low frequency region only. In order to decompose the other regions, 
we modified the Equation (2.3). We describe this method for the case of level 2. Consider the 
following four equations: 


HlH2(z) = H(z 2 )H(z) 

(2.24) 

H\G2(z) = G{z 2 )H(z) 

(2.25) 

G\H2(z) = H(z 2 )G(z) 

(2.26) 

GlG2(z) = G(z 2 )G(z) 

(2.27) 


The first two equations are same as Equation (2.3) for i=l. Using circular and angular 
transformations on these four ID filters we obtain 2D filters H1H2 C , H1G2 C , G1H2 C , G1G2 C , 
HlH2 a , HlG2 a , GlH2 a and GlG2 a . 

To decompose in the low-low region the set of filters to be used are: 

F, = H1H2 C HlH2 a ; F 2 = H1H2 C HlG2 a ; F 3 = H1G2 C HlH2 a ; F 4 = H1G2 C HlG2 a ; (2.28) 

These four filters are same as the filters given by Equation (2.23). Fi = F 2 hh; F 2 = F 2hg ; F3 = F 2gh ; 
F 4 = F 2gg ; 
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To decompose in the low-high region the set of filters to be used are: 

F 5 = H1H2 C GlG2 a ; F 6 = H1H2 C GlH2 a ; F 7 = H1G2 C GlG2 a ; F 8 = H1G2 C GlH2 a ; (2.29) 

The frequency responses of these four filters are shown in Fig. 9. The Region of Support of these 
filters is shown in Fig. 13. 

To decompose in the high-low region the set of filters to be used are: 

F 9 = G1G2 C HlH2 a ; F,p = G1G2 C HlG2 a ; F n = G1H2 C HlH2 a ; F 12 = G1H2 C HlG2 a ; (2.30) 

The frequency responses of these four filters are shown in Fig. 10. The Region of Support of 
these filters is shown in Fig. 14. 

To decompose in the high-high region the set of filters to be used are: 

F , 3 = G1G2 C GlG2 a ; F !4 = G1G2 C GlH2 a ; F 3 = G1H2 C GlG2 a ; F 2 = G1H2 C GlH2 a ; (2.31) 

The frequency responses of these four filters are shown in Fig. 1 1 . The Region of Support of 
these filters is shown in Fig. 15. 

At the synthesis end the conjugate filters are constructed using the conjugates of ID filters in 
similar steps. The filter bank designed for the analysis along with their conjugates at the 
synthesis end constitutes a perfect reconstruction filter bank. In the next chapter, the application 
of these 2D Angular Wavelet Frames for texture classification and segmentation is presented. 
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Chapter 3 

TEXTURE CLASSIFICATION AND SEGMENTATION 

3.1. TEXTURE CLASSIFICATION 

Texture classification proceeds in two phases: characterization - where features are extracted 
from the textured input image, and the classification - where the feature vectors of a given 
textured input image arc assigned to a class using a classification algorithm. 

3.1.1. Texture Characterization 

Here we arrange the output of the filter bank into N-component vector, where N is the number 

of subbands. 


=[s,(kj ) ... d x {k,l)] T 

And interpret the result of the analysis for a given spatial index (k, 1) as a local linear 
transformation of the input vector x (k, 1). A 2-D pyramidal angular wavelet transform with a 
depth I yields N = 1 + 3*1 such feature channels. The texture is then characterized by the set of 
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N first-order probability density functions /?(y ; ), i=l,—N. Alternatively; we can get a more 
compact representation in terms of the channel variances Var {y,}. The success of this method 
obviously depends on the judicious selection of the filter bank. In practice, the channel variances 
are estimated from the average sum of squares over a region of interest R of the given texture 
type. 

*,-*x I >.'<*.'> (3 ' 2) 

rr A (k,l)eR 

where #R denoted the number of pixels in R. We can get a better variance estimate by 
subtracting E {x} 2 in the lowpass channel. 

3.1.2. Texture Classification 

3.1. 2.1. Experimental Data Set 

We performed classification experiments using 26 Brodatz textures. The images taken are of 
size 256*256. For each texture, feature vectors v = (v,,v 2 ,...v„) were evaluated, using eqn (3.2) 

using a window of size w*w over non-overlapping sub regions. The different window sizes that 
we have taken are 8*8, 16*16 and 32*32. 

3.1 .2.2. Classification Algorithm 

The class conditional probability density functions were assumed to be multivariate Gaussian 
with mean vectors and covariance matrices (m k , C k ), (k = 1 , 2, ... . ,K), where K = 26 is the total 
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number of classes. Under such an assumption, the minimum error Bayes Classifier is equivalent 
to assigning a texture sample with feature vector v to the class with minimum distance value 

dv. (v) = (v - m k ) T Ck' ; (v - m k ) + log(det(C k )) (3. 3) 

For each pattern tested, the training was performed on the remaining set (leaving-one-out 
method) using the maximum likelihood estimates of the distribution parameters. 

3.2. TEXTURE SEGMENTATION 

Our next objective is to modify the previous wavelet based feature extraction method to make 
it suitable for texture segmentation. It proceeds in two steps: feature extraction - where features 
are extracted from the textured input image and the optimization stage - in this stage feature 
vectors are grouped into the homogenous segments by minimizing an appropriate quality 
measure. 

3.2.1. Feature Extraction 

Texture features provide a higher order description of the local image content. The 
segmentation algorithm, which is capable of accurately recognizing the patterns, relies on good 
features. By good features we mean that the patterns of the same class yield similar feature 
vectors. Features are extracted from filtered images. The texture features that are used for 
segmentation are local energy measures computed over a window centered on the current spatial 
location. The feature image E k (x, y) corresponding to the filtered imaged (x, y) is given by: 
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E t (x,y) = -rjr S|/.( fl .*)r (3.4) 

N (a,b)eW X) 

where W xy is an N*N window centered at the pixel with coordinates (x, y). The result of 
averaging is a “feature image” in which the gray level at each pixel is a measure of the texture 
present at the corresponding location of the original image. When computing the texture features 
for pixels near the image boundary we assume that the image is extended by its mirror image. 
The size, N, of the averaging window in eqn (3.4) is an important parameter. As the window 
size increases, the computed value of the features at a point is determined by the large 
neighborhood of the image pixels, so that it gives more reliable values. But in real texture 
segmentation problems, the region size and boundaries are unknown. The large window size can 
significantly overlap regions and blurs the boundaries between textured regions. So for accurate 
determination of texture boundary, the smaller window size is needed. Therefore, there is a 
trade-off between reliable measurement and texture boundary, using a proper window size. In 
our experiments, we used different averaging window sizes. 

3.2.2. Segmentation by Clustering in Feature Space 

Features obtained through filtering approach transform the texture segmentation problem into 
a decision problem in the feature space. Corresponding to each pixel, there is a feature vector in 

which the number of features equals to the number of filters used. These feature vectors are 

* 

placed in the feature space as points. If our texture features are capable of discriminating these 
categories then the patterns belonging to the same class will form a cluster in the feature space, 
which is compact and isolated from clusters corresponding to other texture categories. So each 
cluster represents a distinct class, and clusters of points represent different classes of patterns. 



Pattern clustering algorithms are ideal vehicles for recovering such clusters in the feature space. 
Standard K - means algorithm is used to recover the clusters from the feature space. 

3.2.2.I. K - means Algorithm 

The K - means algorithm for cluster-seeking is based on the minimization of a performance 
index which is defined as the sum of the squared distances from all points in a cluster domain to 
the cluster center. The K - means algorithm requires the number of clusters to be known 
beforehand. This algorithm consists of the following steps: 

Step 1. Determine or choose K initial cluster centers zi(l), Z 2 O), . . . , Zk(l). Here zi(l) is the 
texture features of the first cluster center during the first iteration. 

Step 2. At the Mi iteration each pixel x is assigned to one of the K clusters Ci(k), C 2 (k), . . . , 

Ci<(k), where Cj(k) denotes the set of pixels whose cluster center is Zj(k). x is assigned to cluster 
Cj(k) if: 


|| x ~ z j w|| < ! x ~ z /W|| (35) 

Vi,j = 1 , 2 , i*j 

Step 3. From the results of step2, compute the new cluster centers Zj(k + 1), j = 1, 2, . . , K, such 
that the sum of the squared distances from all points in Cj(k) to the new cluster center is 
minimized. In other words, the new cluster center Zj(k + 1) is computed so that the performance 


index 
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is minimized. The new cluster center which minimizes this is the sample mean of Cj(k). 
Therefore, the new cluster center is given by: 


Zj{k+\) 



£ *. 

xeCj (A) 


j - 1,2,..., AT 


(3.7) 


where N } is the number of pixels of the cluster Cj(k). 

Step 4. If Zj(k + 1) = Zj(k) for / = 1, 2, . . , K, the algorithm has converged and the procedure is 

* 

terminated. Otherwise, go to Step 2. 

The determination of the initial cluster centers plays a crucial part because the final clustering 
will depend on the initial cluster centers chosen. The initial cluster centers are chosen by first 
generating the histograms of the feature images and then determining the dominant peaks in the 
histogram. An overview of the texture segmentation method is shown in Fig. 16. 



Fig. 16. An overview of the texture segmentation method 









In chapter 4, the results of texture classification and segmentation, using the methods 


explained in this chapter, are presented. 



42 


Chapter 4 

RESULTS AND DISCUSSION 

In this chapter, the performance of the proposed Angular Wavelet Frames for texture 
classification and segmentation is presented. We have performed our experiments using three 
sets of filters, ‘db2’, ‘db4’ and ‘dblO’. For these three cases, the simulation results for texture 
classification on 26 Brodatz textures are given in section 4. 1 . Results of texture segmentation 
using mosaic images are presented in section 4.2. 

4.1. Performance of Angular Wavelet Frames for Texture Classification 

We performed classification experiments using 26 Brodatz textures. Each image taken is 
of size 256*256 with 256 gray levels. The sub images are obtained by filtering the texture 
images with 2D Angular Wavelet Frames. For each texture, feature vectors of length N, 
where N is the number of sub images were evaluated using Equation (3.2) with a window of 
size w*w over non-overlapping sub regions. For each texture, a total of (256*256)/(w*w) 
feature vectors were evaluated. Mean vectors and covariance matrices (mk, Ck), (k = 1, 2, . . . 

, K), where K = 26 is the total number of classes, of the feature images are calculated. A 
feature vector of a given texture is assigned to a class with minimum distance value in 
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Equation (3.3). For each pattern tested, the training was performed on the remaining set 
(leaving-one-out algorithm). 

The classification results using different sets of filters (‘db2’, ‘db4’ and ‘dblO’) at 
different levels (level 1 - 4 sub images, level 2 - 7 sub images and level 2 - 10 sub images) 
with a window of size 8*8 are summarized in Table 1 . For each texture a total of 1024 feature 
vectors are evaluated. The average classification rates of 26 textures, called the overall 
correct classification rate are listed in the last row of the table. From Table 1, the overall 
classification rate for all the three filters is poor at level 1. But there is much improvement in 
their performance at level 2 using 7 sub images and 10 sub images. The overall classification 
rate is over 94% for all the three filters. Between ‘db2’ and ‘db4’ the performance of ‘db4’ is 
better and among all the three filters, the overall classification rate is good using ‘dbl0\ 
which was due to the length of the filter. The classification results using windows of size 
16*16 and 32*32 are given in Table 2 and Table 3 respectively. There is a significant 
improvement in classification rate with the windows of size 16*16 and 32*32 even at level 1. 
With a window of size 16*16, we have 100% correct classification rate for many images at 
level 2 and 100% overall classification rate using 32*32 window at level 2 with 7 and 10 sub 
images respectively. The results are much better when we extract features using a window of 
larger size. This is because as the window size increases the feature values are determined by 
a larger number of pixels in the window. This gives more reliable values for classification. 
The pattern of misclassification, that is which texture image is classified into which other 
category also termed as ‘confusion matrix’ is shown in Table 4 for the case of ‘db2‘, level 1 - 
4 sub images and using a window of size 16*16. From Table 4 we find that classification 
errors occurred between textures pairs that are difficult to discriminate visually. These errors 
are minimized with higher length filters and by extracting features using a window of larger 
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size. Confusion matrix for the case of ‘dblO’ level 1 - 4 sub images, using a window of size 
32*32 is given in Table 5. 

A comparison of overall correct classification rates of different methods is shown in Table 
6. From Table 6 we see that AWF method not only outperform DWT, which was expected 
due to choice of translation invariant representation but also its performance is better 
compared to DWF. This is because AWF can capture the dominant angular component of 
textures. 

4.2. Simulations Results for Texture Segmentation 

We have used mosaic images of two, three and four regions for segmentation task in our 
experiments. The reason for using mosaic images is that the texture boundaries are precisely 
known. But, such simple boundaries usually do not occur in real scenes. Images are filtered at 
level 2 by 2D angular wavelet frames constructed using ‘dblO’. Features are extracted from 
the filtered images using the Equation (3.4). These features are clustered using K - means 
clustering algorithm to obtain the segmentation of the images. Segmentation results are 
presented for different window sizes. In Fig. 17 to Fig. 20, segmentation results for two 
regioned texture images are shown. In Fig. 18, segmentation of the original image using a 
window of size 5*5 is not good, the segmentation is good with 9*9 window. In Fig. 19, 
segmentation using 7*7 window is not satisfactory and it is good with 11*11 window. In Fig. 
21 and in Fig. 22 the segmentation results for a three regioned images are given. In Fig. 23, 
the segmentation results for a four regioned image are given. Using a larger window gives 
more reliable measurement but for accurate determination of texture boundary, the smaller 
window size is needed. Therefore, there is a trade-off between reliable measurement and 


texture boundary. 
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Table 1. Classification Rate using window of size 8*8 


Texture 

Level 1 - 4 sub images 

Level 2-7 sub images 

Level 2-10 sub images 

Name 

db2 

db4 

dblO 

db2 

db4 

dblO 

db2 

db4 

dblO 

Dll 

67.38 

74.02 

72.66 

94.92 

94.43 

95.41 

98 24 

98.93 

98.63 

D12 

57.42 

66.31 

70.31 

85.55 

88.57 

92.77 

91.02 

93.85 

96.68 

D15 

71.48 

78.32 

82.32 

89.06 

90.72 

93.07 

91.99 

94.34 

96.78 

D16 

80.86 

86.23 

88.48 

99.12 

98.83 

99.41 

99.71 

100 

99.71 

D17 

89.45 

95.31 

93.36 

98.14 

99.22 

99.22 

99.22 

99.80 

99.22 

D20 

90.33 

84.47 

75.98 

94.92 

95.02 

98.63 

97.95 

98.93 

99.90 

D21 

97.95 

97.95 

99.12 

99.22 

99.71 

99.90 

99.61 

99.80 

100 

D29 

63.77 

57.32 

67.48 

88.48 

89.16 

87.11 

95.41 

95.80 

96.00 

D34 

92.77 

94.82 

97.46 

96.00 

96.00 

98.54 

97.17 

98.24 

99.80 

D35 

73.44 

71.58 

72.36 

82.42 

86 33 

84.96 

86.91 

86.82 

86.33 

D37 

91.70 

91.89 

91.70 

97.75 

97.17 

97.66 

97.36 

98.73 

98.83 

D38 

77.05 

75.20 

81.15 

86.33 

83.69 

88.48 

90 43 

91.60 

92.97 

D49 

99.71 

100 

100 

100 

100 

100 

100 

100 

100 

D51 

67.48 

73.63 

76.27 

91.60 

91.70 

96.29 

93.46 

96.09 

98.34 

D52 

65.33 

73.14 

74.22 

98.93 

99.02 

99.12 

99.32 

99.71 

99.41 

D56 

68.26 

70.80 

83.50 

86.04 

91.31 

95.51 

93.46 

96.88 

99.22 

D66 

48.83 

59.57 

52.73 

95.61 

94.14 

95.70 

99.32 

99.51 

99.80 

D68 

88.38 

88.57 

90.63 

98.34 

98.05 

99.51 

98.83 

98.93 

99.90 

D74 

72.27 

78.61 

88.38 

94.34 

96.09 

96.88 

97.95 

99.22 

98.63 

D85 

87.99 

86.04 

82.71 

94.73 

93.75 

92.97 

96.19 

96.58 

96.88 

D87 

82.42 

84.18 

82.32 

97.85 

99.02 

99.71 

99.51 

100 

99.90 

D90 

84.57 

87.50 

89.16 

87.60 

87.40 

92.68 

90.53 

93.75 

96.29 

D101 

83.40 

89.06 

93.36 

95.41 

95.12 

95.90 

98.05 

98.73 

99.12 

D103 

97.66 

99.02 

97.95 

99.32 

99.51 

99.51 

97.07 

99.02 

98.63 

D109 

87.60 

84.47 

89.06 

97.95 

97.07 

98.63 

98.93 

99.41 

99.51 

Dill 

67.87 

76.66 

78.03 

95.21 

96.00 

97.46 

99.71 

99.71 

99.90 

Overall 

79.05 

81.72 

83.49 

94.03 

94.50 

95.96 

96.44 

97.48 

98.09 
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Table 2. Classification rate using window of size 16*16 


Texture 

Level 1 - 4 sub images 

Level 2-7 sub images 

Level 2-10 sub images 

Name 

db2 

db4 

dblO 

db2 

db4 

dblO 

db2 

db4 

dblO 

Dll 

92.58 

94.92 

96.48 

100 

99.61 

99.22 

100 

100 

100 

D12 

82.42 

83.59 

90.23 

98.05 

97.66 

99.22 

98 44 

100 

99.61 

D15 

82.81 

87.89 

91.41 

96.88 

97.27 

98.44 

98.05 

98.44 

99.61 

D16 

91.80 

94.14 

98.05 

100 

100 

100 

100 

100 

100 

D17 

98.83 

99.61 

99.61 

100 

100 

100 

100 

100 

100 

D20 

98.44 

97.66 

93.36 

99.61 

100 

100 

100 

100 

100 

D21 

100 

100 

100 

100 

100 

100 

100 

100 

100 

D29 

77.73 

79.30 

84.77 

100 

98.44 

98.83 

100 

100 

100 

D34 

98.83 

99.22 

100 

99.61 

98.83 

100 

100 

100 

100 

D35 

87.11 

92.19 

93.36 

94.92 

96.48 

96.88 

98.05 

98.44 

97.27 

D37 

96.09 

96.48 

95.31 

100 

98.44 

100 

99.22 

100 

100 

D38 

87.50 

85.55 

94.14 

98.44 

97.66 

98.83 

96.88 

97.66 

100 

D49 

100 

100 

100 

100 

100 

100 

100 

100 

100 

D51 

83.98 

87.11 

91.41 

97.27 

99.22 

99.61 

99.22 

99.61 

100 

D52 

85.55 

91.02 

90.23 

100 

100 

100 

100 

100 

100 

D56 

98.44 

96.88 

99.22 

100 

99.22 

99.61 

100 

100 

100 

D66 

72.27 

88.28 

79.69 

100 

100 

100 

100 

100 

100 

D68 

94.53 

94.92 

96.48 

100 

99.61 

100 

100 

100 

100 

D74 

98.05 

98.83 

99.22 

100 

100 

100 

100 

100 

100 

D85 

96.88 

98.05 

97.66 

99.61 

99.61 

96.09 

100 

100 

98.83 

D87 

98.05 

97.66 

98.44 

100 

100 

100 

100 

100 

100 

D90 

94.92 

96.09 

96.09 

97.27 

98.83 

99.22 

99.22 

99.22 

100 

D101 

99.61 

99.61 

99.61 

100 

100 

100 

100 

100 

100 

D103 

100 

100 

100 

100 

100 

100 

100 

100 

100 

D109 

93.75 

92.58 

97.27 

99.61 

100 

99.61 

100 

100 

100 

Dill 

94.53 

97.27 

97.27 

100 

100 

100 

100 

100 

100 

Overall 

92.49 

94.19 

95.36 

99.28 

99.26 

99.44 

99.58 

99.74 

99.82 
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Table 3. Classification rate using window of size 32*32 


Texture 

Level 1 - 4 sub images 

Level 2-7 sub images 

Level 2-10 sub images 

Name 

dl)2 

dl)4 

dblO 

tlb2 

dl)4 

(lb 10 

db2 

dl>4 

cl b 1 0 

Dll 

100 

100 

100 



M 

100 

100 

100 

D12 

93.75 

98.44 

98.44 



100 

100 

100 

100 

D15 

87.50 

90.63 

100 



100 

100 

100 

100 

D16 

98.44 

96.88 

100 

100 


100 

100 

100 

100 

D17 

100 

100 

100 



100 

100 

100 

100 

D20 

100 

100 

98.44 



100 

100 

100 

100 

D21 

100 

100 

100 



100 


100 

100 

D29 

93.75 

90.63 

93.75 



100 

100 

100 


D34 

100 

100 

100 



100 

100 

100 


D35 

96.88 

100 

100 



100 

100 



D37 

100 

100 

96.88 



100 

100 



D38 

98.44 

98.44 

98.44 



100 

100 


100 

D49 

100 

100 

100 


100 

100 

100 


100 

D51 

95.31 

96.88 

98.44 



100 

100 

100 

100 

D52 

98.44 

98.44 

98.44 


100 

100 

100 

100 

100 

D56 

100 

100 

100 



100 

100 

100 

100 

D66 

95.31 

100 

89.06 


100 

100 

100 

100 

100 

D68 

98.44 

98.44 

100 


100 

100 

100 

100 

100 

D74 

100 

100 

100 


100 

100 

100 

100 

100 

D85 

100 

100 

' 100 


100 

100 

100 

100 

100 

D87 

100 

100 

100 

100 


100 

100 

100 

100 

D90 

100 

100 

98.44 


100 

100 

100 

100 

100 

D101 

100 

100 

100 


I 

100 

100 

100 

100 

D103 

100 

100 

100 


■ 

100 

100 

100 

100 

D109 

96.88 

98.44 

100 


1 

100 

100 

100 

100 

Dill 

98.44 

100 

100 


■f 

100 

100 

100 

100 

Overall 

98.14 

98.74 

98.86 


100 

100 

100 

100 

100 















Table 4. Confusion matrix for the case of 'db2', level 1 using window of size 16*16 
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Overall Classification Rate = 92,49% 




Table 5. Confusion matrix for the case of 'dblO 1 , level 1 using window of size 32*32 
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Overall Classification Rate = 98,86% 



Table 6. Comparison of Overall Correct Classification Rates (window size = 32*32) 


Method 

Classification Rate 

Level 1 - 4 sub images 

Level 2 - 7 sub images 

DWT 

91.93% 

95.44% 

DWF 

96.48% 

98.83% 

AWF 

98.14% 

100% 


Note: Resuits printed in light letter is directly taken from Reference [10] for comparison 
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Figure, 17, (»( Original Image. Segmentation results using a window of size (b) 5*5 (c) 7*7 (d) 9*9 
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Fig. 20. (a) Original image. Segmentation results using window of size (b) 7*7 (c) 9*9 (d) 11*11 
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Chapter 5 

CONCLUSIONS AND FUTURE SCOPE 
5.1. CONCLUSIONS 

We proposed the method of constructing 2D Angular Wavelet Frames using circular and 
angular transformations. We have proved that the constructed 2D filters using the 
transformations of ID PRFB, along with their conjugates at the synthesis end constitute a PRFB. 
Our classification results demonstrate that AWF approach is a promising approach for texture 
characterization. We observe from Table 5, that the rate of misclassification is less for textures 
having visually dominant angular component. It suggests that the constructed AWF is well suited 
for textures having dominant angular component. 

The segmentation results show that the 2D AWF is capable of segmenting multi-texture 
image. The segmentation results are good for two and three regioned texture images. The regions 
of the texture image segmentation are uniform and there is no hole in the interior regions. For 
three and four regioned images, the boundary is not spatially accurate and for four regioned 
image there are holes in the interior regions. The results could be improved by using spatial 
relationship among the pixels. The segmentation of the feature images has been realized by 



applying k - means clustering method, where the number of regions are known before, and the 
initial cluster centers are to be chosen properly so that they lie close to the clusters in the feature 
space. So the method proposed is supervised. 

5.2. FUTURE SCOPE 

The proposed 2D AWF method can be modified to classify Rotation Invariant Textures by 
combining the filters for a given level and for all angles. This can also be used to classify Scale 
Invariant Textures by combining the filters for all the levels given an angle (orientation). 
Segmentation can be made unsupervised by using merging techniques or by modifying the 
standard k - means algorithm so that it can iteratively determine the number of clusters in the 
feature space. For a large number of regions in an image, the boundaries were not spatially 
accurate. This could be overcome by using some spatial relationship among the pixels. 
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